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We present an efficient family of quantum circuits for a fundamental primitive in quantum infor- 
mation theory, the Schur transform. The Schur transform on n d dimensional quantum systems is a 
transform between a standard computational basis to a labelling related to the representation theory 
of the symmetric and unitary groups. If we desire to implement the Schur transform to an accuracy 
of e, then our circuit construction uses a number of gates which is polynomial in n, d and log(e _1 ). 
The important insights we use to perform this construction are the selection of the appropriate 
subgroup adapted basis and the Wigner-Eckart theorem. Our efficient circuit construction renders 
I ■ numerous protocols in quantum information theory computationally tractable and is an important 

' new efficient quantum circuit family which goes significantly beyond the standard paradigm of the 

\Q , quantum Fourier transform. 

o 
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INTRODUCTION 



The last decade has seen the development and expansion of a robust theory of quantum information [1-6]. 
The basic goal of this new work has been the identification and quantification of different information re- 
sources in situations where the laws of quantum theory are applied to the physical carriers of information. 
Quantum information theory has made great progress in understanding the optimal rates of the manipula- 
tion and transmission of quantum information. Despite this success, however, much of the work in quantum 
information theory may not be of practical value. This is because most of the work in quantum information 
theory has focused on protocols which allow for unbounded quantum computational resources. Thus while 
the transforms in the quantum information protocols are well defined, whether these transforms can be 
implemented with quantum circuits whose size scales efficiently with the size of the quantum information 
problem is often left unaddressed. An analogous situation arises classically, for example, in the theory of 
classical error correcting codes. On the one hand, we would like the classical error correcting code to attain 
some characteristic efficiency for communicating over a noisy channel. On the other hand, we would also 
like to design codes whose encoding and decoding does not significantly lag our communication. In order 
to be of practical value a classical error correcting code must use computational resources which scale at a 
reasonable rate. While the goal of performing classical coding tasks in polynomial or even linear time has 
long been studied, quantum information theory results have typically ignored questions of efficiency. For 
example, random quantum coding results (such as [7-10]) require an exponential number of bits to describe, 
and like classical random coding techniques, do not yield efficient algorithms. There are a few important 
exceptions. Some quantum coding tasks, such as Schumacher compression [3, 11], are essentially equivalent 
to classical circuits, and as such can be performed efficiently on a quantum computer by carefully modifying 
an efficient classical algorithm to run reversibly and to deal properly with ancilla systems [12]. Another 
example, which illustrates some of the challenges involved, is Ref. [13] 's efficient implementation of entan- 
glement concentration[5]. Quantum key distribution[14] not only runs efficiently, but can be implemented 
with entirely, or almost entirely, single-qubit operations and classical computation. Fault tolerant quan- 
tum computing [15] usually seeks to perform error correction with as few gates as possible, although with 
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teleportation-based techniques [16, 17] computational efficiency may not be quite as critical to the threshold 
rate. Finally, some randomized quantum code constructions have been given efficient constructions using 
classical derandomization techniques in [18]. In this paper we present an efficient family of quantum circuits 
for a transform used ubiquitously[19-30] in quantum information protocols, the Schur transform. Our effi- 
cient construction of the Schur transform adds to the above list a powerful new tool for finding algorithms 
that implement quantum communication tasks. 

The Schur transform is a unitary transform on n d-dimensional quantum systems (n qudits). The basis 
change corresponding to the Schur transform goes from a standard computational basis on the n qudits to 
a labelling related to the representation theory of the symmetric and unitary groups; much like the Fourier 
transform, it thus transforms from a local to a more global, collective basis, which captures symmetries of 
the system. In this article we show how to efficiently implement the Schur transform as a quantum circuit. 
The size of the circuit we construct is polynomial in the number of qudits, n, the dimension of the individual 
quantum systems, d, and the log of accuracy to which we implement the transform, log(e _1 ). Our efficient 
quantum circuit for the Schur transform makes possible efficient quantum circuits for numerous quantum 
information tasks: optimal spectrum estimation [19, 20], universal entanglement concentration[22], universal 
compression with optimal overflow exponent [23, 24], encoding into decoherence-free subsystems [26-29], op- 
timal hypothesis testing[25], and quantum and classical communication without shared reference frames[30]. 
The central role of the Schur transform in all of these protocols is due to the fact that the symmetries of 
independent and identically distributed quantum states are naturally treated by the representation theory 
of the symmetric and unitary groups. Thus in addition to making practical these quantum information 
protocols, the Schur transform is an interesting new unitary transformation with an interpretation relating 
to these symmetries. 

There are many difficulties in designing an efficient quantum circuit for the Schur transform which we 
overcome in this paper. The first difficulty is in efficiently representing the basis used in the Schur transform, 
the Schur basis. A second difficulty comes in the actual circuit construction. In particular we would like 
to construct the Schur transform from a series of Clebsch-Gordan transforms. However, it is not at all 
obvious how to efficiently implement these Clebsch-Gordan transforms, nor is obvious that such a cascade 
can perform the complete Schur transform. 

Our resolution to these problems begins by our selection of certain subgroup-adapted bases for the Schur 
basis. In particular we use the Gel'fand-Zetlin basis[31] and the Young- Yamanouchi basis (sometimes called 
Young's orthogonal basis) [32] . We note that subgroup-adapted bases are also used in constructing efficient 
quantum circuits for Fourier transforms over nonabelian finite groups[33]. However, we should emphasize 
that the Schur transform is not a Fourier transform over a finite group, although connections between 
such transforms and the Schur transform exist, and are discussed in part II of this paper. By choosing 
the Gel'fand-Zetlin basis and the Young- Yamanouchi basis, we are able to show that the Schur transform 
can be constructed from a cascade of Clebsch-Gordan transforms. Further, the use of the Gel'fand-Zetlin 
basis, combined with the Wigner-Eckart theorem, allows us to efficiently implement the Clebsch-Gordan 
transform. In particular the Wigner-Eckart theorem allows us to recursively express the d dimensional 
Clebsch-Gordan transform in terms of the d— 1 dimensional Clebsch-Gordan transform and small, efficiently 
implementable, unitary transforms. This produces an efficient recursive construction of the Clebsch-Gordan 
transform. Without the recursive structure we exploit, a naive circuit construction would seem to require 
n O(d ) g a tes. Our recursive exploitation of the Wigner-Eckart theorem allows us to implement the Clebsch- 
Gordan transform to accuracy e using poly(d, logn, log 1/e) gates. The total size of our circuit construction 
for the Schur transform is npoly(cZ, log n, log 1/e). 

The outline of the paper is as follows. In Section II we introduce the Schur transform, along with basic 
concepts from representation theory, and review the numerous applications of the Schur transform in quantum 
information theory. In Section III we introduce the basis labelling scheme used in the Schur transformation 
using the concept of a subgroup-adapted basis. Once we have a concrete Schur basis defined, we describe 
the Clebsch-Gordan transform and explain how to use it to give an efficient circuit for the Schur transform 
in Sec. IV. Finally, we complete the algorithm in Sec. V by constructing an efficient circuit for the Clebsch- 
Gordan transform. 



II. THE SCHUR TRANSFORM AND ITS APPLICATIONS 

Consider a system of n d-dimensional quantum systems: n qudits. Fix a standard computational basis 
\i), i = 1 . . . d for the state space of each qudit: C d . A basis for the system (C 2 )®" is then ® \i 2 ) <£> • • • ® 
\i n ) = \ii, 12, ■ ■ ■ , i n ) where ik = 1 . . . d. The Schur transform is a unitary transform on the standard basis 
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|*i j *2 5 • ■ • j in)- After the Schur transform, the standard computational basis is relabeled as |A) \p)\q) (symbols 
which we later define). In this section we review the basic representation theory necessary to understand 
the Schur basis and also review the applications of this transformation to different protocols in quantum 
information theory. 

A. Representation theory background 

The Schur transform is related to the representations of two groups on (C d )® n , a representation of the 
symmetric group and a representation of the unitary group. We first recall the basics of representation 
theory before introducing these representations. For a more detailed description of representation theory, 
the reader should consult [34] for general facts about group theory and representation theory or [35] for 
representations of Lie groups. See also [36] for a more introductory and informal approach to Lie groups and 
their representations. 

Representations: For a complex vector space V, define End(V) to be set of linear maps from V to itself 
(endomorphisms) . A representation of a group Q is a vector space V together with a homomorphism from Q 
to End(y), i.e. a function R : Q — > End(V) such that R(5i)R(g 2 ) = R(<7i<?2)- If R(<?) is a unitary operator 
for all g, then we say R is a unitary representation. Furthermore, we say a representation (R, V) is finite 
dimensional if V is a finite dimensional vector space. In this paper, we will always consider complex finite 
dimensional, unitary representations and use the generic term 'representation' to refer to complex, finite 
dimensional, unitary representations. Also, when clear from the context, we will denote a representation 
(R, V) simply by the representation space V. 

The reason we consider only complex, finite dimensional, unitary representations is so that we can use 
them in quantum computing. If d = dim V, then a ci-dimensional quantum system can hold a unit vector in 
a representation V. A group element g e Q corresponds to a unitary rotation R(g), which can in principle 
be performed by a quantum computer. 

Homomorphisms: For any two vector spaces V\ and V2, define Hom(Vi, V2) to be the set of linear trans- 
formations from Vi to V2. If Q acts on V\ and V2 with representation matrices Ri and R2 respectively, 
then the canonical action of Q on Hom(Vi, V2) is given by the map from M to R2(g)MRi(g)~ 1 for any 
M e Hom(Vi, V2). For any representation (R, V) define V Q to be the space of (/-invariant vectors of V: i.e. 
V Q := {\v) e V : R(g)\v) = \v)Vg e Q}. Of particular interest is the space Honi(Vi, which can be 

thought of as the linear maps from V\ to V2 which commute with the action of Q. If Hom(Vi, Vi) Q contains 
any invertible maps (or equivalently, any unitary maps) then we say that (Ri, V\) and (R2, V2) are equivalent 
representations and write 

9 

Vi = v 2 . 

This means that there exists a unitary change of basis U : V\ — * V2 such that for any g e Q, UTLi(g)W = 

R 2 (g). 

Dual representations: Recall that the dual of a vector space V is the set of linear maps from V to C and 
is denoted V* . Usually if vectors in V are denoted by kets (e.g. \v}) then vectors in V* are denoted by 
bras (e.g. (v\). If we fix a basis {\v\), \v 2 ), ■ ■ .} for V then the transpose is a linear map from V to V* 
given by \vi) — > (vi\. Now, for a representation (R, V) we can define the dual representation (R*,V*) by 
H*(g)(v*\ := (w*|R(g _1 ). If we think of R* as a representation on V (using the transpose map to relate V 
and V*), then it is given by R*(.g) = (R(g _1 )) T . When R is a unitary representation, this is the same as the 
conjugate representation R(g)* , where here * denotes the entrywise complex conjugate. One can readily verify 
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that the dual and conjugate representations are indeed representations and that Hom(Vi, V2) = V{ (8) V2. 

Irreducible representations: Generically the unitary operators of a representation may be specified (and 
manipulated on a quantum computer) in an arbitrary orthonormal basis. The added structure of being a 
representation, however, implies that there are particular bases which are more fundamental to expressing the 
action of the group. We say a representation (R, V) is irreducible (and call it an irreducible representaiton, or 
irrep) if the only subspaces of V which are invariant under R arc the empty subspace {0} and the entire space 
V. For finite groups, any finite-dimensional complex representation is reducible; meaning it is decomposable 
into a direct sum of irreps. For Lie groups, we need additional conditions, such as demanding that the 
representation R(p) be rational; i.e. its matrix elements are polynomial functions of the matrix elements <?y 
and (detg) -1 . We say a representation of a Lie group is polynomial if its matrix elements are polynomial 
functions only of the gij . 
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Isotypic decomposition: Let Q be a complete set of inequivalent irreps of Q. Then for any reducible 
representation (R, V) there is a basis under which the action of R(g) can be expressed as 

= ®©^) = 0r A ( 5 ) ® I„ A (1) 

AGS J =1 AGS 

where Ae 5 labels an irrcp (r A , V\) and n A is the multiplicity of the irrep A in the representation V. Here 
we use = to indicate that there exists a unitary change of basis relating the left-hand size to the right-hand 
side. 1 Under this change of basis we obtain a similar decomposition of the representation space V (known 
as the isotypic decomposition): 

V%7 A 0C\ (2) 

AGS 

Thus while gencrically we may be given a representation in some arbitrary basis, the structure of being a 
representation picks out a particular basis under which the action of the representation is not just block 
diagonal but also maximally block diagonal: a direct sum of irreps. 

Moreover, the multiplicity space C" A in Eq. (2) has the structure of Hom(Vx, V) Q . This means that for 
any representation (R, V), Eq. (2) can be restated as 

vli($V x ®Hom(V x ,Vf- (3) 

AGS 

Since Q acts trivially on Hom(V\, V) Q , Eq. (1) remains the same. As with the other results in this chapter, 
a proof of Eq. (3) can be found in [35] , or other standard texts on representation theory. 

The value of Eq. (3) is that the unitary mapping from the right-hand side (RHS) to the left-hand side 
(LHS) has a simple explicit expression: it corresponds to the canonical map ip : A ® Hom(A, B) — > B given 
by f (a ® /) = /(a). Of course, this doesn't tell us how to describe Hom(V\, V) s , or how to specify an 
orthonormal basis for the space, but we will later find this form of the decomposition useful. 

B. The Schur Transform 

We now turn to the two representations relevant to the Schur transform. Recall that the symmetric 
group S n of degree n, is the group of all permutations of n objects. Then we have the following natural 
representation of the symmetric group on the space (C d )® n : 

P(s)|h) ® \i 2 ) ® • • • ® \i n ) = \i s -i(i)) ® |« s -i(2)) ® ■ ■ ■ ® N s -i(n)) (4) 

where s £ S n is a permutation and s(i) is the label describing the action of s on label i. For example, if we are 
considering <S 3 and the permutation we are considering is the transposition s = (12), then P(s)|ii, i 2 , 13) = 
1*27*1,^3)- (P, (C d )®") is the representation of the symmetric group which will be relevant to the Schur 
transform. Note that P obviously depends on n, but also has an implicit dependence on d. 

Now we turn to the representation of the unitary group. Let Ud denote the group of dx d unitary operators. 
Then there is a representation of Ud given by the n-fold product action as 

Q(U)\h) ® \i 2 ) ® • • • ® \i n ) = U\h) ® U\i 2 ) ® • • • ® U\i n ) (5) 

for any U e Ud- More compactly, we could write that Q(U) = U® n . (Q, (<C d )® n ) is the representation of 
the unitary group which will be relevant to the Schur transform. 



We only need to use = when relating representation spaces. In Eq. (1) and other similar isomorphisms, we instead explicitly 
specify the dependence of both sides on g £ Q. 
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Since both P(s) and Q(U) meet our above criteria for reducibility, they can be decomposed into a direct 
sum of irreps as in Eq. (1), 

P(s) e? 0I„„® Pa ( s ) 

a 

Q(U) i 0I m ,®q^([/) (6) 

where n Q (m^) is the multiplicity of the ath (/3th) irrep p Q (s) (q^([/)) in the representation P(s) (Q(C/)). 
At this point there is not necessarily any relation between the two different unitary transforms implementing 
the isomorphisms in Eq. (6). However, further structure in this decomposition follows from the fact that 
P(s) commutes with Q(U): P(s)Q(U) = Q(U)P(s). This implies, via Schur's Lemma, that the action of the 
irreps of P(s) must act on the multiplicity labels of the irreps Q(J7) and vice versa. Thus, the simultaneous 
action of P and Q on (C d )®" decomposes as 

Q(U)P(s) \ maS ® qpiU) ® Pa (s) (7) 

a p 

where m a ^ can be thought of as the multiplicity of the irrep Pa(s) ® Q/3(^0 of the group x S n . 

Not only do P and Q commute, but the algebras they generate (i.e. A := P(C[5„]) = Span{P(s) : s £ S n } 
and B := Q(C[W<j]) = Span{Q([7) : U £ Ltd}) centralize each other[35], meaning that B is the set of operators 
in End((C d )®") commuting with A and vice versa, A is the set of operators in End((C d )®") commuting 
with B. This means that the multiplicities m a ^ are either zero or one, and that each a and (3 appears at 
most once. Thus Eq. (7) can be further simplified to 

Q(t/)P(s) Ud ^ n 0qA((7) ® PA («) (8) 

A 

where A runs over some unspecified set. 

Finally, Schur duality (or Schur-Weyl duality) [35, 37] provides a simple characterization of the range of 
A in Eq. (8) and shows how the decompositions are related for different values of n and d. To define Schur 
duality, we will first need to specify the irreps of <S„ and hid- 

Let Td, n = {A = (Ai, A2, • • • , A<j)|Ai > A2 > • • • > > and Xi — n} denote partitions of n into 

< d parts. We consider two partitions (Ai, . . . , Xd) and (Ai, . . . , A<j, 0, . . . , 0) equivalent if they differ only by 
trailing zeroes; according to this principle, l n :— l n , n contains all the partitions of n. Partitions label irreps 
of S n and Ud as follows: if we let d vary, then Id,n labels irreps of S ni and if we let n vary, then Td.n labels 
polynomial irreps of Ud- Call these (p\,V\) and (q^, Qf) respectively, for A £ ld, n - We need the superscript 
d because the same partition A can label different irreps for different Ud', on the other hand the <S„-irrep V\ 
is uniquely labeled by A since n = J2i 

For the case of n qudits, Schur duality states that there exists a basis (which we label |A)|g.\)|PA)sch ano - 
call the Schur basis) which simultaneously decomposes the action of P(s) and Q(U) into irreps: 

Q(t/)|A)| gA )K) Sch = |A)(q^)kA»bA> Sch 

P( S )|A)| 9A )K) Sch = |A)|«A>(PA(*)bA)) Sc h (9) 
and that the common representation space (<C d )® n decomposes as 

( C d)®n U % S " Qd p A . (10) 

Aeid,„ 

The Schur basis can be expressed as superpositions over the standard computational basis states 
\h,i2, •■•,««) as 

|A,9A,PA)sch= E [U Sch ] A ;^VJ* 1 z 2 ...z„), (11) 

ii,i 2 ,...,i n 

where Us c h is the unitary transformation implementing the isomorphism in Eq. (10). Thus, for any U £Ud 
and any s £ S n , 

U Sch Q([/)P(s)l4 h = ]T |A)(A|®q^(f/)®p A ( S ). (12) 
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If we now think of Ug c h as a quantum circuit, it will map the Schur basis state |A, <Za,Pa)s c 1i t° the computa- 
tional basis state |A, q x ,p x ) with A, q x , and p x expressed as bit strings. The dimensions of the irreps p x and 
vary with A, so we will need to pad the \q\,p\) registers when they are expressed as bit strings. We will 
label the padded basis as |A)|g)|p), explicitly dropping the A dependence. Later in the paper we will show 
how to do this padding efficiently with only a logarithmic spatial overhead. We will refer to the transform 
from the computational basis \ii,i2, ■ ■ ■ ,in) to the basis of three strings |A)|g)|p) as the Schur transform. 
The Schur transform is shown schematically in Fig. 1. Notice that just as the standard computational basis 
\i) is arbitrary up to a unitary transform, the bases for Qf and V\ are also both arbitrary up to a unitary 
transform, though we will later choose particular bases for Qf and V x . 

Example of the Schur transform — Let d = 2. Then for n = 2 there are two valid partitions, A x = 2, A 2 = 
and Xi = A 2 = 1. Here the Schur transform corresponds to the change of basis from the standard basis to 
the singlet and triplet basis: |A = (1,1), q x = 0, Px = 0) Sc h = ^ (|01) - |10)), |A = {2,0), q x = +l,p x = 

0)sch = |00), |A = (2,0),<za = 0,p A = 0) S ch = ^(|01) + |10», and |A = (2,0), q x = -l,p x = 0) Sch = |11). 
Abstractly, then, the Schur transform then corresponds to a transformation 



Usch 



|A = (1,1), (JA = 0,Px = 0) Sch 
\\ = (2,0),q x = +l,p x = 0) Sch 
| A = (2,0), q x = 0,pa = 0) Sch 

|A = (2,0), 9 A = -l,PA = 0)sch 



|00> |01> 1 10) |11) 



" 


1 


1 


0" 




V2 


1 














1 


1 


















1 



(13) 



It is easy to verify that the A = (1,1) subspace transforms as a one dimensional irrep of Ui and as the 
alternating sign irrep of 52 and that the A = (2, 0) subspace transforms as a three dimensional irrep of IA2 
and as the trivial irrep of S 2 - Notice that the labeling scheme for the standard computational basis uses 2 
qubits while the labeling scheme for the Schur basis uses more qubits (one such labeling assigns one qubit 
to I A), none to \p) and two qubits to \q)). Thus we see how padding will be necessary to directly implement 
the Schur transform. 

To see a more complicated example of the Schur basis, let d = 2 and n — 3. There are again two valid 
partitions, A = (3,0) and A = (2, 1). The first of these partitions labels to the trivial irrep of S3 and a 4 
dimensional irrep of Uz ■ The corresponding Schur basis vectors can be expressed as 

|A = (3, 0), q x = +3/2, PA = 0) Sch = |000> 



|A = (3, 0),q x = +l/2,p x = 0) Sch = -j= (|001) + |010) + |100>) 

|A = (3, 0),q x = -l/2,p x = 0) S ch = ^= (|011) + |101) + |110» 
|A = (3,0),g A - -3/2, p A - 0) Sch = |111). 



(14) 



The second of these partitions labels a two dimensional irrep of S3 and a two dimensional irrep of U2- Its 
Schur basis states can be expressed as 



A = 


(2,1),<ZA 


= +l/2,p A = 


0) S ch = 


7! (|100) - 


- loio)) 


A = 


(2,1),9A 


= -l/2,p A = 


0) S ch = 


7! (|101> - 


-|011)) 


A = 


(2,1),9a 


= +l/2,p A = 


l)sch = 


Vfiooi)- 


|010) + 1 100) 
V6 


A = 


(2,1),9A 


= -l/2,p A = 


l)sch = 


Vfliio)- 


|101) + |011) 
V6 



(15) 



We can easily verify that Eqns. (14) and (15) indeed transform under Ui and S3 the way we expect; not so 
easy however is generalizing this basis to any n and d, let alone coming up with a natural circuit relating 
this basis to the computational basis. However, note that p x determines whether the first two qubits are 
in a singlet or a triplet state. This gives a hint of a recursive structure that we will exploit in Sec. Ill to 
describe Schur bases for any choice of n and d, and in Sec. IV to construct an efficient recursive algorithm 
for the Schur transform. 



7 




FIG. 1: The Schur transform. Notice how the direct sum over A in Eq. (10) becomes a tensor product between the 
| A) register and the \q) and \p) registers. Since the number of qubits needed for \q) and \p) vary with A, we need 
slightly more spatial resources, which are here denoted by the ancilla input |0). 



C. Constructing Qf and V\ using Schur duality 

So far we have said little about the form of Qf and V\, other than that they are indexed by partitions. It 
turns out that Schur duality gives a straightforward description of the irreps of hid and S n . We will not use 
this explicit description to construct the Schur transform, but it is still helpful for understanding the irreps 
and V\. As with the rest of this section, proofs and further details can be found in [35]. 

We begin by expressing A € ld, n as a Young diagram in which there are up to d rows with Aj boxes in row 
i. For example, to the partition (4, 3, 1, 1) we associate the diagram 



U . (16) 

Now wc define a Young tableau T of shape A to be a way of filling the n boxes of A with the integers 1, . . . , n, 
using each number once and so that integers increase from left to right and from top to bottom. For example, 
one valid Young tableau with shape (4, 3, 1, 1) is 



1 


4 


G 


7| 


2 


■5 


8 




3 
9 









For any Young tableau T, define Row(T) to be set of permutations obtained by permuting the integers 
within each row of T; similarly define Col(T) to be the permutations that leave each integer in the same 
column of T. Now wc define the Young symmetrizer U\ : t to be an operator acting on (C d )® n as follows: 

n A:T := ^ ( £ sgn(c)P( c) j ( £ P(r) ) . (17) 

' \c£Col(T) / \r£Row(T) / 

It can be shown that the Young symmetrizer IIa:t is a projection operator whose support is a subspace 
isomorphic to Q d x . In particular U Sch IT A:T U^ ch = |A)(A|®|t/(T))(y(T)|(8)I Q d for some unit vector \y{T)) E V\. 
Moreover, these vectors \y(T)) form a basis known as Young's natural basis, though the \y(T)) are not 
orthogonal, so we will usually not work with them in quantum circuits. 

Using Young symmetrizers, we can now explore some more general examples of Qf and V\. If A = (n), 
then the only valid tableau is 

|l|2| ...[nl 

The corresponding <S„-irrep V( n ) is trivial and the %-irrep is given by the action of Q on the totally symmetric 
subspace of (C d )® n , i.e. {\v) : P(s)\v) = |u)Vs e S n }. On the other hand, if A = (1™), meaning (1, 1, ... , 1) 
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(n times), then the only valid tableau is 

T 

2 




The <S„-irrep V{\^ is still one-dimensional, but now corresponds to the sign irrep of <S„, mapping s to sgn(s). 
The %-irrep Q^n) is equivalent to the totally antisymmetric subspace of (C d )®", i.e. {\v) : P(s)\v) = 
sgn(s) |u)Vs € 5„}. Note that if d > n, then this subspace is zero-dimensional, corresponding to the restriction 
that irreps of Ud are indexed only by partitions with < d rows. 

Other explicit examples of Ud and S n irreps are presented from a particle physics perspective in [38] . We 
also give more examples in Sec. IIIB, when we introduce explicit bases for Qf and V\. 

D. Applications of the Schur Transform 

The Schur transform is useful in a surprisingly large number of quantum information protocols. Here we, 
review these applications, with particular attention to the use of the Schur transform circuit in each protocol. 
We emphasize again that our construction of the Schur transform simultaneously makes all of these tasks 
computationally efficient. 

1. Spectrum and state estimation 

Suppose we are given many copies of an unknown mixed quantum state, p® n . An important task is to 
obtain an estimate for the spectrum of p from these n copies. An asymptotically good estimate (in the sense 
of large deviation rate) for the spectrum of p can be obtained by applying the Schur transform, measuring 
A and taking the spectrum estimate to be (Ai/n, . . . , Xd/n)[19, 21]. Thus an efficient implementation of the 
Schur transform will efficiently implement the spectrum estimating protocol (note that it is efficient in d, not 
in log(<i)). Estimating p reduces to measuring |A) and \q), but optimal estimators have only been explicitly 
constructed for the case of d = 2[20]. Further, optimal quantum hypothesis testing can be obtained by a 
similar protocol [2 5]. 

2. Universal distortion-free entanglement concentration 

Let \iP)ab be a bipartite partially entangled state shared between two parties, A and B. Suppose we are 
given many copies of \iP)ab and we want to transform these states into copies of a maximally entangled 
state using only local operations and classical communication. Further, suppose that we wish this protocol 
to work when neither A nor B know the state \iP}ab- Such a scheme is called a universal (meaning it works 
with unknown states \ip) ab) entanglement concentration protocol, as opposed to the original entanglement 
concentration protocol described by Bennett et.al.[b\. Further we also would like the scheme to produce 
perfect maximally entangled states, i.e. to be distortion free. Universal distortion-free entanglement con- 
centration can be pcrformcd[22] by both parties performing Schur transforms on their n halves of \iP)ab, 
measuring their |A), discarding \q) and retaining \p). The two parties will now share a maximally entangled 
state of varying dimension depending on what A was measured. This dimension asymptotes to 2 nH , where 
H is the entropy of one of the parties' reduced mixed states. 

3. Universal Compression with Optimal Overflow Exponent 

Measuring |A) weakly so as to cause little disturbance, together with appropriate relabeling, comprises a 
universal compression algorithm with optimal overflow exponent (rate of decrease of the probability that the 
algorithm will output a state that is much too large) [23, 24]. 
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4- Encoding and decoding into decoherence-free subsystems 



Further applications of the Schur transform include encoding into decoherence-free subsystems [26-29]. 
Decoherence-free subsystems are subspaces of a system's Hilbert space which are immune to decoherence 
due to a symmetry of the system-environment interaction. For the case where the environment couples 
identically to all systems, information can be protected from decoherence by encoding into the \p\) basis. 
We can use the inverse Schur transform (which, as a circuit can be implemented by reversing the order 
of all gate elements and replacing them with their inverses) to perform this encoding: simply feed in the 
appropriate |A) with the state to be encoded into the \p) register and any state into the \q) register into the 
inverse Schur transform. Decoding can similarly be performed using the Schur transform. 



An application of the concepts of decohcrcncc-frcc subsystems comes about when two parties wish to 
communicate (in either a classical or quantum manner) when the parties do not share a reference frame. 
The effect of not sharing a reference frame is the same as the effect of collective decoherence (the same 
random unitary rotation has been applied to each subsystem). Thus encoding information into the \p) 
register will allow this information to be communicated in spite of the fact that the two parties do not share 
a reference frame[30]. Just as with decoherence-free subsystems, this encoding and decoding can be done 
with the Schur transform. 



In the last section, we defined the Schur transform in a way that left the basis almost completely arbitrary. 
To construct a quantum circuit for the Schur transform, we will need to explicitly specify the Schur basis. 
Since we want the Schur basis to be of the form \X,q,p), our task reduces to specifying orthonormal bases 
for and V\. We will call these bases Qf and Pa, respectively. 

We will choose Qf and Pa to both be a type of basis known as a subgroup- adapted basis. In Sec. Ill A 
we describe the general theory of subgroup-adapted bases, and in Sec. IIIB, we will describe subgroup- 
adapted bases for and V\. As we will later see, the properties of these bases are intimately related to the 
structure of the algorithms that work with them. In this section, we will show how the bases can be stored 
on a quantum computer with a small amount of padding, and in the following sections we will show how 
the subgroup-adapted bases described here enable efficient implementations of Clebsch-Gordan and Schur 
duality transforms. 



Here we review the basic idea of a subgroup adapted basis. We assume that all groups we talk about are 
finite or compact Lie groups. Suppose (r, V) is an irrep of a group Q and TL is a proper subgroup of Q. We 
will construct a basis for V via the representations of TL. 

Begin by restricting the input of r to TL to obtain a representation of H, which we call (r|^, V[ H ). Note 
that unlike V, V[ H may be reducible. In fact, if we let (r' Q , V^) denote the irrcps of TL, then V[ H will 
decompose under the action of TL as 



5. Communication without a shared reference frame 



III. SUBGROUP ADAPTED BASES AND THE SCHUR BASIS 



A. Subgroup Adapted Bases 



(18) 



or equivalently, r\u decomposes as 



v{h)=vi H {h) = ^\ na ®v' a {h) 



(19) 



a6H 



where TL runs over a complete set of inequivalcnt irreps of TL and n a is the branching multiplicity of the 
irrep labeled by a. Note that since r is a unitary representation, the subspaces corresponding to different 
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irreps of TL are orthogonal. Thus, the problem of finding an orthonormal basis for V now reduces to the 
problem of (1) finding an orthonormal basis for each irrep of 7i, V a and (2) finding orthonormal bases for the 
multiplicity spaces C" a . The case when all the n a are either or 1 is known as multiplicity-free branching. 
When this occurs, we only need to determine which irreps occur in the decomposition of V, and find bases 
for them. 

Now consider a group Q along with a tower of subgroups Q = Gi D G2 D ■ ■ ■ D Gk-i 3 Gk = {e} where {e} 
is the trivial subgroup consisting of only the identity element. For each Gi, denote its irreps by V£, for a € Gi- 
Any irrep oiG — Gi decomposes under restriction to G2 into C/2-irreps: say that V% 2 appears n aiia2 times. 
We can then look at these irreps of Gi , consider their restriction to G3 and decompose them into different 
irreps of G3 ■ Carrying on in such a manner down this tower of subgroups will yield a labeling for subspaces 
corresponding to each of these restrictions. Moreover, if we choose orthonormal bases for the multiplicity 
spaces, this will induce an orthonormal basis for G ■ This basis is known as a subgroup- adapted basis and basis 
vectors have the form \ai2, rri2, 0:3, ms, . . . , a n , m n ), where \rrii) is a basis vector for the (n Qi ljQ , i -dimcnsional) 
multiplicity space of V^. in V*t} ■ 

If the branching for each Gi+i C Gi is multiplicity-free, then we say that the tower of subgroups is canonical. 
In this case, the subgroup adapted basis takes the particularly simple form of \ct2, ■ ■ ■ , a n ), where each a, e Gi 
and a i+ i appears in the decomposition of V ai [ g . Often we include the original irrep label a = ct\ as well: 
|ai,Q!2, • • • , Qffe). This means that there exists a basis whose vectors are completely determined (up to an 
arbitrary choice of phase) by which irreps of G\, . . . , Gk they transform according to. Notice that a basis for 
the irrep V a does not consist of all possible irrep labels a,, but instead only those which can appear under 
the restriction which defines the basis. 

The simple recursive structure of subgroup adapted bases makes them well-suited to performing explicit 
computations. Thus, for example, subgroup adapted bases play a major role in efficient quantum circuits 
for the Fourier transform over many nonabelian groups [33]. 



B. Explicit orthonormal bases for Q d and V\ 



In this section we describe canonical towers of subgroups for Ud and S n , which give rise to subgroup- 
adapted bases for the irreps Qf and V\. These bases go by many names: for Ud (and other Lie groups) the 
basis is called the Gel'fand-Zetlin basis (following [31]) and we denote it by Qf, while for S n it is called the 
Young- Yamanouchi basis, or sometimes Young's orthogonal basis (see [32] for a good review of its properties) 
and is denoted P\. The constructions and corresponding branching rules are quite simple, but for proofs we 
again refer the reader to [35]. 

The Gel'fand-Zetlin basis for Qf — For Ud, it turns out that the chain of subgroups {1} = Uo C U\ C 
. . . C Ud-i C Ud is a canonical tower. For c < d, the subgroup U c is embedded in Ud by U c := {u e Ud ■ 
u\i) = \i) for i = c + 1, . . . , d}. In other words, it corresponds to matrices of the form 











(20) 



where u is a c x c unitary matrix. 

Since the branching from Ud to Ud-i is multiplicity- free, we obtain a subgroup- adapted basis Qf, which 
is known as the Gel'fand-Zetlin (GZ) basis. Our only free choice in a GZ basis is the initial choice of basis 
|1), . . . , \d) for C d which determines the canonical tower of subgroups U\ C . . . C Ud- Once we have chosen 
this basis, specifying reduces to knowing which irreps Q^ 1 appear in the decomposition of Qxlu d l - 
Recall that the irreps of Ud are labeled by elements of Id ,n with n arbitrary. This set can be denoted by 
Z d ++ := U n Id,n = {A e 7L d : Ai > . . . > \ d > 0}. For fi e , A e we say that n interlaces A and write 

[i~<\ whenever Ai > \i\ > A 2 . . . > Xd-i > Md-i > A<j. In terms of Young diagrams, this means that \i is a 
valid partition (i.e. a nonnegative, nonincreasing sequence) obtained from removing zero or one boxes from 
each column of A. For example, if A = (4,3, 1, 1) (as in Eq. (16)), then ^A can be obtained by removing 
any subset of the marked boxes below, although if the box marked * on the second line is removed, then the 
other marked box on that line must also be removed. 









X 




* 


X 





(21) 
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Thus a basis vector in Qf corresponds to a sequence of partitions q = (qd 



A. 



, qi ) such that 



9i^92^ 



d 



^qd and qj € Z 3 ++ for j = 1 , . . . , d. Again using A 
5 (any d > 4 is possible), we might have the sequence 



(4,3,1,1) as an example, and choosing 



>- 



95 



94 



93 



92 



91 



(22) 



Observe that it is possible in some steps not to remove any boxes, as long as qj has no more than j rows. 

In order to work with the Gel'fand-Zetlin basis vectors on a quantum computer, we will need an efficient 
method to write them down. Typically, we think of d as constant and express our resource use in terms of n. 
Then an element of ld,n can be expressed with d\og(n + 1) bits, since it consists of d integers between and 
n. (This is a crude upper bound on \Id,n\ = ("^-T 1 )' ^ u t ^ or cons tant d it is good enough for our purposes.) 
A Gel'fand-Zetlin basis vector then requires no more than d 2 log(n + 1) bits, since it can be expressed as d 
partitions of integers no greater than n into < d parts. Here, we assume that all partitions have arisen from 
a decomposition of (C d )® n , so that no Young diagram has more than n boxes. Unless otherwise specified, 
our algorithms will use this encoding of the GZ basis vectors. 

It is also possible to express GZ basis vectors in a more visually appealing way by writing numbers in the 
boxes of a Young diagram. If 9i^ . . . ^9<j is a chain of partitions, then we write the number j in each box 
contained in qj but not qj_i (with q n = (0)). For example, the sequence in Eq. (22) would be denoted 



1 


1 


2 


5 


2 


3 


3 




3 








5 









(23) 

Equivalently, any method of filling a Young diagram with numbers from 1, . . . , d corresponds to a valid 
chain of irreps as long as the numbers are nondecreasing from left to right and are strictly increasing from 
top to bottom. The resulting diagram is known as a semi-standard Young tableau and gives another way 
of encoding a GZ basis vector; this time using n log d bits. (It turns out the actual dimension of Qf is 



IT 

1 lr 



and later in this section we will give an algorithm for efficiently 
qubits. However, this is not necessary for most 



rii<i<j<d( A * - a j +3 I 
encoding a GZ basis vector in the optimal [log dim Qf 
applications.) 

Example: irreps of U 2 — To ground the above discussion in an example more familiar to physicists, wc 
show how the GZ basis for U 2 irreps corresponds to states of definite angular momentum along one axis. An 
irrep of U2 is labeled by two integers (Ai, A2) such that Ai + A2 = n and Ai > A2 > 0. A GZ basis vector for 
Q\ has A 2 + to l's in the first row, followed by Ai — (A 2 + to) 2's in the first row and A2 2's in the second row, 
where m ranges from to Ai — A 2 . This arrangement is necessary to satisfy the constraint that numbers are 
strictly increasing from top to bottom and are nondecreasing from left to right. Since the GZ basis vectors 
are completely specified by to, we can label the vector |(Ai, A 2 ); (A 2 
A = (9,4) and to — 2 would look like 



m )) <= Q\ simply by \m). For example, 



1 


1 


1 


1 


1 1 1 1 2 | 2 | 2 


2 


2 


2 


2 





(24) 

Now observe that dim Q\ = Ai — A 2 + 1, a fact which is consistent with having angular momentum 
J = (Ai — A 2 )/2. We claim that to corresponds to the Z component of angular momentum (specifically, the Z 
component of angular momentum is to— J = m— (Ai — A 2 )/2). To see this, first note that U\ acts on a GZ basis 
vector I to) according to the representation x — » x X2+m , for ieWi; equivalently q| (( g J)) \m) = x X2+m \m). 

Since q|(y/ 2 )|m) = y n \m) = y Xl+X2 \m), we can find the action of e i6a * = °) ^ ) on \m). We do 

this by combining the above arguments to find that q|(e ie<T ^) \m) = e 2ie ( A2+m )e _ie(Al+A2) |m) = e 2ie( - m ~ J ^\m). 
Thus we obtain the desired action of a Z rotation on a particle with total angular momentum J and Z- 
componcnt of angular momentum m. 

Example: The defining irrep of Ud — The simplest nontrivial irrep of Ud is its action on C d . This 
corresponds to the partition (1), so we say that (q^-,, Q^)) is the defining irrep of Ud with = C d 
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U. Let |1), . . . , \d) be an orthonormal basis for C d corresponding to the canonical tower 
C • • • C hid- It turns out that this is already a GZ basis. To see this, note that 



Qd) 1 • This is because \d) generates Qfn) 1 ' a trivial irrep of Ud-i', and |1), . . . , \d — 1) 



and q^(t/) 

of subgroups U\ 

d d-i 
2(i)lw d _! — 2(0) 

generate Q^ 1 , a defining irrep of Ud-i- Another way to say this is that \j) is acted on according to the 
trivial irrep of U\, . . . ,Uj-i and according to the defining irrep of Uj, . . . ,Ud- Thus \j) corresponds to the 
chain of partitions {(Op 1 , (l)" - - 7-1-1 }. We will return to this example many times in the rest of the paper. 

The Young- Yamanouchi basis for V\ — The situation for S n is quite similar. Our chain of subgroups is 
{e} = S\ C <S>2 C . . . C S n , where for m < n we define S m C S n to be the permutations in S n which leave the 
last n — m elements fixed. For example, if n = 3, then S3 = {e, (12), (23), (13), (123), (321)}, <S 2 = {e, (12)}, 
and Si — {e}. Recall that the irreps of S n can be labeled by X n = I n , n - the partitions of n into < n parts. 

Again, the branching from S n to S n -i is multiplicity-free, so to determine an orthonormal basis P\ for 
the space V\ we need only know which irreps occur in the decomposition of Vy^s . It turns out that the 
branching rule is given by finding all ways to remove one box from A while leaving a valid partition. Denote 



the set of such partitions A — □. Formally, A — □ := T n n {A — ej : j = 1, 



in Z™ with a one in the j 



th 



1}, where ej is the unit vector 



position and zeroes elsewhere. Thus, the general branching rule is 



(25) 



For example, if A = (3, 2, 1), we might have the chain of partitions: 



n = 6 



n = 5 



n = 4 



n = 3 



n = 2 



□ 



n = 1 



(26) 



Again, we can concisely label this chain by writing the number j in the box that is removed when restricting 
1. The above example would then be 



from Sj to Sj 



1 


3 


6| 


2 


4 




5 







(27) 

Note that the valid methods of filling a Young diagram are slightly different than for the Ud case. Now we 
use each integer in 1 , . . . , n exactly once such that the numbers are increasing from left to right and from 
top to bottom. The resulting tableau is called a standard Young tableau. (The same filling scheme appeared 
in the description of Young's natural representation in Sec. II C, but the resulting basis states are of course 
quite different.) 

This gives rise to a straightforward, but inefficient, method of writing an element of P\ using logn! bits. 
However, for applications such as data compression[23, 24] we will need an encoding which gives us closer to 
the optimal log Pa bits. First we note an exact (and efficiently computable) expression for |Pa| = dim "Pa: 



dim Pa 



Ai +d- l!A 2 + d-2!< 



n (** 

l<i<j<d 



(28) 



Now we would like to efficiently and reversibly map an element of P\ (thought of as a chain of partitions 
V = (Pn = A, . . . ,pi = (1)) e P\, with pj e Pj+i— d) to an integer in [\P\\] := {1, . . . ,\P\\}. We will construct 
this bijection f n : P\ —* [|P\|] by defining an ordering on Pa and setting f n (p) ■= \{p' € Pa : p' < p}\- 
First fix an arbitrary, but easily computable, (total) ordering on partitions in l n for each n; for example, 
lexicographical order. This induces an ordering on Pa if we rank a basis vector p e P\ first according to 
Pn-i, using the order on partitions we have chosen, then according to p n -2 and so on. We skip p n , since it 
is always equal to A. In other words, for p,p' e Pa, p > p' if p n -i > p' n -i or Pn-i = p' n -i an d Pn-2 > p' n -2 
or p n -! = p'„_ 1; p n -2 = p' n -2 and Pn-3 > p' n -3i and s0 on - Tnus fn-P\—> [|P\|] can be easily verified to 
be 



fn{p) = fn{Pl, ■ ■ ■ ,Pn) ■= 1 + X] dim P n 



k=2 uEp k — □ 
H<Pk-l 



(29) 



Thus f n is an injective map from P\ to [\P\\ 
gives an efficient way to calculate each \P\\ 



. Moreover, since there are 0(n 2 ) terms in Eq. (29) and Eq. (28) 
this mapping can be performed in time polynomial in n. 
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IV. THE CLEBSCH-GORDAN TRANSFORM AND EFFICIENT CIRCUITS FOR THE SCHUR 

TRANSFORM 

In this section, we describe an efficient circuit for the Schur transform Us c h- To do so, we first describe 
the Clebsch-Gordan transform, which decomposes a Kronecker product of %-irreps Qf <g> Qf into a direct 
sum of other %-irreps. We defer the algorithm for the Clebsch-Gordan transform to Sec. V, but give a 
description of its properties in Sec. IV A. Then in Sec. IV B, we show how to construct the Schur transform 
by cascading a series of Clebsch-Gordan transforms and performing reversible classical manipulations of Pa 
and Q d . 



The Clebsch-Gordan Series and Transform 



Suppose we have two irreps of Ud given by the partitions \i and v. Qf and Qf. The tensor product of 
these irreps Qf ® Qf (with representation matrices q^(f/) ® Q.f{U) for U G Ud) is a new representation of 
Ud- This new representation will generally be reducible into irreps of Ud] following Eq. (3) we have 



Ql®Qi= Q\ ® Hom(Q^, Qf <g> Qf) 



d\U d 



This decomposition is referred to as the Clebsch-Gordan series. Setting 
we obtain the corresponding decomposition of the representation matrices as 



q^) 



(30) 



dimHom(Q^, Qf <g> Qf) Ud , 



(31) 



Ae2 



The unitary matrix which maps the LHS of Eq. (31) to the RHS is known as the Clebsch-Gordan transform 
and we denote it Uqq. It maps vectors of the form € Qf® Qf to superpositions of vectors of the 

form \\)\q x )\a), where A e Z d ++ , \q\) € Qf and a e Hom(Q d , Qf ® Qf) Ud . 

The multiplicity space Hom(Q^, Qf <g> Qf) Ud plays a crucial role in the CG transform. In particular, the 
inverse CG transform (Ucg) 1S gi yen simply by 



(U^\ qx )\a)=a\q x ). 



(32) 



Note that on the LHS, we interpret \a) as a vector in the multiplicity space Hom(Q^, Qf <g> Qf) Ud , and on 
the RHS we treat a as an operator. These are normalized such that \a) is a unit vector if and only if a is 
an isometry. 

We now specialize to the case of tensoring in the defining irrep Ql,, for which the CG transform is partic- 
ularly simple. Recall that for A e and 1 < j < d we have A + = (Ai, . . . , Aj_i, Xj + 1, Aj+i, . . . , A<j). 
This is not always a valid partition, i.e. if A' = A + ej, the condition X'j_ 1 > X'j might not hold. Recall 
that the valid partitions (of any integer) are given by the set Z+ + . Then the Clebsch-Gordan series we are 
interested in is given by 



Qt®Q d 



(i) 





3 = 1, -a 



Qd 



X+ej 



(33) 



This is the "add a single box" prescription for tensoring in a defining representation of Ud- we add a single 
box to a Young diagram and if the new Young diagram is a valid Young diagram (i.e. corresponds to a valid 
partition), then this irrep appears in the Clebsch-Gordan series. For example if A = (3, 2, 1) then 



or in Young diagram form 



Q 3 



(3,2,1) 



Q 3 



U 3 



(1) 



2(4,2,1) 



© 2(3,3,1) © 2(3,2,2) 



(34) 



u 3 



(35) 
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Note that if we had d > 3, then the partition (3, 2, 1, 1) would also appear. 

We now seek to define the CG transform as a quantum circuit. We specialize to the case where one of the 
input irrcps is the defining irrep, but allow the other irrep to be specified by a quantum input. The resulting 
CG transform is defined as: 



u CG = £ 



|A)(A|®U; 



A,(l) 
CG • 



(36) 



X + ej € % d ++ , j 



\q) € Qf and i <G [d]. The output is a 



€ [d] and \q') G Qf,. Equivalently, we 



This takes as input a state of the form |A)|g)|z), for A e 
superposition over vectors |A)|A')|<7'), where A' 

could output |A)|j)|g') or |j)|A')|g'), since (A, A'), (X,j) and (A',j) are all trivially related via reversible 
classical circuits. 

To better understand the input space of Ucg ; we introduce the model representation Qf := © AeZ d Qf, 

with corresponding matrix q_f(U) = ^2 X |A)(A| <g> q x (U). The model representation (also sometimes called 
the Schwinger representation) is infinite dimensional and contains each irrep once. 2 Its basis vectors are of 
the form |A, q) for A e and \q) £ Qf. Since Qf is infinite-dimensional, we cannot store it on a quantum 
computer and in this paper work only with representations Qf with |A| < n; nevertheless Qf is a useful 
abstraction. 

Thus Ucg decomposes Qf <8> Q d ^ into irreps. There are two important things to notice about this version 
of the CG transform. First is that it operates simultaneously on different input irreps. Second is that 
different input irreps must remain orthogonal, so in order to to maintain unitarity Ucg needs to keep the 
information of which irrep we started with. However, since A' = X + ej, this information requires only storing 
some j e [d]. Thus, U C g is a map from Qf <g> C d to Qf <E> C d , where the C d in the input is the defining 
representation and the C d in the output tracks which irrep we started with. 



|A> 











Ucg 















|A> 



|A'> 



I?) 



FIG. 2: Schematic of the Clebsch-Gordan transform. Equivalently, we could replace either the A output or the A' 
output with j. 



B. Constructing the Schur Transform from Clebsch-Gordan Transforms 

We now describe how to construct the Schur transform out of a series of Clebsch-Gordan transforms. 
Suppose we start with an input vector . . . , i n ) e (C d )® n , corresponding to the ^-representation (Q^n)®"- 

According to Schur duality (Eq. (10)), to perform the Schur transform it suffices to decompose (Q d 1 ^)® n into 

Wd-irreps. This is because Schur duality means that the multiplicity space of Qf must be isomorphic to V\. 
In other words, if we show that 

s n 

then we must have V' x = V\ when A € Td.n and V' x = {0} otherwise. 



2 By contrast, L 2 (Ud), which will we not use, contains with multiplicity dim 
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To perform the W^-irrep decomposition of Eq. (37), we simply combine each of . . . , \i n ) using the 
CG transform, one at a time. We start by inputting jA^ 1 - 1 ) = |(1)), \i±) and fa) into TJcg which outputs 
JA^ 1 -*) and a superposition of different values of |A^ 2 ^) and |cjr 2 ) . Here A*- 2 ) can be either (2,0) or (1,1) and 
\Q2) € Q^(2>- Continuing, we apply Ucg to I A^ 2 ^ ^ | j ^3) , and output a superposition of vectors of the form 
|A (2) )|A (3) )|q 3 ), with A (3) € 2" d , 3 and \q 3 ) e Qf (a) . Each time we are combining an arbitrary irrep A*- fc ^ and an 
associated basis vector \qk) € together with a vector from the defining irrep |ifc+i). This is repeated 

for k = 1, . . . , n — 1 and the resulting circuit is depicted in Fig. 3. 



I(l)>— I 

In) 
\h) - 



Ucg 



\u) 




l*n) 




Ucg — 



|A«> 
|A< 2 )) 
|A< 3 )) 



| A («- 2 )) 

I At"- 1 )) 
|A(»)) 

\ln) 



FIG. 3: Cascading Clebsch-Gordan transforms to produce the Schur transform. Not shown are any ancilla inputs to 
the Clebsch-Gordan transforms. The structure of inputs and outputs of the Clebsch-Gordan transforms are the same 
as in Fig. 2. 

Finally, we are left with a superposition of states of the form lA' 1 ), . . . , \^)\q,i) , where \q n ) <E Q^<„)j 
\(. k ) £ Xd.k and each \^ is obtained by adding a sing le box to A^- 1 ); i.e. A<» = A (fe-1 ) + e jk for some 
jk € [d]. If we define A = A*-™- 1 and \q) = \q n ), then we have the decomposition of Eq. (37) with V' x 
spanned by the vectors jA^ 1 ', . . . , A'™ -1 )) satisfying the constraints described above. But this is precisely 
the Young- Yamanouchi basis Pa that we have defined in Sec. III! Since the first k qudits transform under 
Ud according to Qiik) > Schur duality implies that they also transform under Sk according to P^W • Thus 
we set \p) = |AW, . . . , A^"- 1 )) (optionally compressing to [log \P\\~\ qubits using the techniques described in 
the last section) and obtain the desired |A)|g)[p). As a check on this result, note that each A^ fc ^ is invariant 
under Q(Ud) since U® n acts on the first k qubits simply as U® k . 

If we choose not to perform the poly(n) steps to optimally compress we could instead 

have our circuit output the equivalent . . . , j n -i), which requires only nlogd qubits and asymptotically 
no extra running time. 

We can now appreciate the similarity between the Ud CG "add a box" prescription and the iS„_i C S n 
branching rule of "remove a box." Schur duality implies that the representations Qf, that are obtained by 
decomposing Qf ® Qk-. are the same as the 5 n -irreps Vx> that include V\ when restricted to S n -\. 

Define Tcg(", d, e) to be the time complexity (in terms of number of gates) of performing a single hid 
CG transform to accuracy e on Young diagrams with < n boxes. Then the total complexity for the Schur 
transform is n ■ (TcG(n,d,e/n) + 0(1)), possibly plus a poly(n) factor for compressing the V\ register to 
[log dim Pa] qubits (as is required for applications such as data compression and entanglement concentration, 
cf. Sec. II D). In the next section we will show that Tcg(?i, d, e) is poly (log n, d, log 1/e), but first we give a 
step-by-step description of the algorithm for the Schur transform. 

Algorithm: Schur transform (plus optional compression) 

Inputs: (1) Classical registers d and n. (2) An n qudit quantum register fa, . . . , i n ). 
Outputs: Quantum registers |A)|g)|p), with A £ Td.n, q £ Q\ and p e P\. 
Runtime: n ■ {Tccin, d, e/n) + O(l)) to achieve accuracy e. 

(Optionally plus poly(n) to compress the V\ register to [log dim Pa ] qubits.) 
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Procedure: 

1. Initialize |AW) := |(1)> and \ qi ) = 

2. For k = 1,. ..,n- 1: 

3. Apply U CG to \\^)\q k )\i k+1 ) to obtain output \jk)\X {k+1) )\q k +i) , where A< fc+1 ) = \ {k) + e 0k . 

4. Output | A) := |AW), |g) := |g„) and |p) := | j x , . . . ,j„_i). 

5. (Optionally use Eq. (29) to reversibly map |j l7 . . . , j„-i) to an integer p e [dimP^].) 

This algorithm will be made efficient in the next section, where we efficiently construct the CG transform 
for Ud, proving that Xcg(^, d, e) = poly(logn, d, log 1/e). 

V. EFFICIENT CIRCUITS FOR THE CLEBSCH-GORDAN TRANSFORM 

We now turn to the actual construction of the circuit for the Clebsch-Gordan transform described in 
Sec. IV A. To get a feel for the what will be necessary, we start by giving a circuit for the CG transform that 
is efficient when d is constant; i.e. it has complexity n oi - d \ which is poly(n) for any constant value of d. 

First recall that dim Qf < (n + l) d . Thus, controlled on A, we want to construct a unitary transform 
on a D-dimensional system for D = maxAex d „ dim Q d = poly(n). There are classical algorithms [39] to 
compute matrix elements of Ucg to an accuracy ei in time poly(D) poly log(l/ei). Once we have calcu- 
lated all the relevant matrix elements (of which there are only polynomially many), we can (again in time 
poly(-D) poly log(l/e)) decompose Ucg into D 2 polylog(Z?) elementary one and two-qubit operations [40-43]. 
These can in turn be approximated to accuracy e 2 by products of unitary operators from a fixed finite set 
(such as Clifford operators and a ir/8 rotation) with a further overhead of poly log(l/e 2 )[44, 45]. We can 
either assume the relevant classical computations (such as decomposing the D x D matrix into elementary 
gates) are performed coherently on a quantum computer, or as part of a polynomial-time classical Turing 
machine which outputs the quantum circuit. In any case, the total complexity is poly (n, log 1/e) if the 
desired final accuracy is e and d is held constant. 

The goal of this section is to reduce this running time to poly(n, d, log(l/e)); in fact, we will achieve 
circuits of size poly(<i,logn,log(l/e)). To do so, we will reduce the Ud CG transform to two components; 
first, a Ud-i CG transform, and second, a d x d unitary matrix whose entries can be computed classically 
in poly (d, log n, 1/e) steps. After computing all d 2 entries, the second component can then be implemented 
with poly(<i, log 1/e) gates according to the above arguments. 

This reduction from the Ud CG transform to the Ud-i CG transform is a special case of the Wigner- 
Eckart Theorem, which we review in Sec. V A. Then, following [39, 46], we use the Wigner-Eckart Theorem 
to give an efficient recursive construction for Ucg hi Sec. VB. Putting everything together, we obtain a 
quantum circuit for the Schur transform that is accurate to within e and runs in time n-poly(logn, d, log 1/e), 
optionally plus an additional poly(n) time to compress the \p) register. 

A. The Wigner-Eckart Theorem and Clebsch-Gordan transform 

In this section, we introduce the concept of an irreducible tensor operator, which we use to state and prove 
the Wigner-Eckart Theorem. Here we will find that the CG transform is a key part of the Wigner-Eckart 
Theorem, while in the next section we will turn this around and use the Wigner-Eckart Theorem to give a 
recursive decomposition of the CG transform. 

Suppose (ri, V\) and (r 2 , V 2 ) are representations of Ud- Recall that Hom(Vi, V 2 ) is a representation of Ud 
under the map T -> y 2 {U)Tt 1 {U)- 1 for T e Hom(Vi, V 2 ). If T = {T U T 2 , . . .} C Hom(Vi, V 2 ) is a basis for 
a Wd-invariant subspace of Hom(Vi, V 2 ), then we call T a tensor operator. Note that a tensor operator T is 
a collection of operators {Ti} indexed by i, just as a tensor (or vector) is a collection of scalars labeled by 
some index. For example, the Pauli matrices {a x , a y , a z } C Hom(C 2 ,C 2 ) comprise a tensor operator, since 
conjugation by U 2 preserves the subspace that they span. 

Since Hom(Fi, V 2 ) is a representation of Ud, it can be decomposed into irreps. If T is a basis for one of 
these irreps, then we call it an irreducible tensor operator. For example, the Pauli matrices mentioned above 
comprise an irreducible tensor operator, corresponding to the three-dimensional irrep Qf 2 )- Formally, we say 
that T v = {Tg v } qv< zQd C Hom(Fi, V 2 ) is an irreducible tensor operator (corresponding to the irrep Qf,) if for 
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all U €Ud we have 



r^T^iU)- 1 = J2 (M(U)\Q»)T} v . (38) 



Now assume that V\ and V2 are irreducible (say V\ = Q^ and V2 = Qf), since if they are not, we could 
always decompose Hom(Vi, V2) into a direct sum of homomorphisms from an irrep in V\ to an irrep in V<i. 
We can decompose Hom(<2^, Qf) into irreps using Eq. (3) and the identity Hom(A, B) = A* ® £> as follows: 

Hom(Q^Q^)^ Q^®Hom(Q^Hom(Q^Q^))"- 



e^®Hom(Q^(Q^)*®Q^) 

M (39) 
= d ® ((Qf,)* ® (Q^)* ® Qa) Wi! 

" d Q^®Hom(Q^® Q^) % 

Now consider a particular irreducible tensor operator T" C Hom(Q^, Q^) with components T£ where q u 
ranges over Qf. We can define a linear operator T : Qf ® <2^ — > by letting 

%>W~^W (40) 

for all <7 M G Qf,Qv € Qj5 and extending it to the rest of QjJ ® by linearity. By construction, T G 
Hom(Q^ ® Qy, Q^), but we claim that in addition T is invariant under the action of Ud\ i.e. that it lies in 
Hom(Q^ ® Qf, Qf) Ud . To see this, apply Eqns. (38) and (40) to show that for any U e U d , e and 
q v ^Qf, we have 

q^(f/)f [q^^-^q^C/)" 1 ]!^)!^) = £ (^Iq^)"^^^^, q^I/)" 1 ^) 

= Yl (Mmo'uHMiur^T^) (41) 

= T v q \q,)=f\q,)\q v ). 

Now, fix an orthonormal basis for Hom(Q^ ® Qf, Qf) Ud and call it M^ v . Then we can expand T in this 
basis as 

f= f °- a > (4 2 ) 



where the f a are scalars. Thus 



(9A|T£|g„>= X! ^(«A|a|^,^). (43) 



This last expression (qa |ct|9/j., 5^) bears a striking resemblance to the CG transform. Indeed, note that the 
multiplicity space Hom(Q^, Qf®Qf,) Ud from Eq. (30) is the dual of Hom(Q^<g>Q^, Qf) Ud (which contains a), 
meaning that we can map between the two by taking the transpose. In fact, taking the conjugate transpose 
of Eq. (32) gives (q x \a = (q x ,^\\J^. Thus' 

(qx\a\q^,qu) = (qx, « f |U#q \q li , q v ) . (44) 

The arguments in the last few paragraphs constitute a proof of the Wigner-Eckart theorem[47], which is 
stated as follows: 
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Theorem 1 (Wigner-Eckart) For any irreducible tensor operator T" = {^} 9i/e gd C Hom(Q^, Q^), 
there exist T a eC /or eac/i a e M*„ sucft tftaf /or aZZ G QjJ, |qr„) G and € 

(«a|T;J«„)= X! r a ( gA ,at|U^|g M ,g„). (45) 

Thus, the action of tensor operators can be related to a component T a that is invariant under Ud and a 
component that is equivalent to the CG transform. We will use this in the next section to derive an efficient 
quantum circuit for the CG transform. 



B. A recursive construction of the Clebsch-Gordan transform 

In this section we show how the Ud CG transform (which here we call U^q) can be efficiently reduced to 

the Ud-i CG transform (which we call U^q 1 '). Our strategy, following [46], will be to express U^q in terms 

ofUd-i tensor operators and then use the Wigner-Eckart Theorem to express it in terms of U^q 1 '. After we 

have explained this as a relation among operators, we describe a quantum circuit for Uqq that uses U^q 1 ' 
as a subroutine. 

First, we express Uqq as a Ud tensor operator. For jj, G \q) G Q d and i G [d], we can expand 

U^| M )k)N)as 

U^| M )k)|i) = |M) E E CtfjvL + efiW). (46) 

je[d] s.t. ?'6Q£ + 

for some coefficients C^' 3 q , G C. Now define operators Tf' 3 : Q d — > Q^ +ej by 

= E E c^ie'x?!. ( 47 ) 



so that U^Q decomposes as 



U^| M >k>|i> = |A*> E \^ + e j )Tt 3 \q). (48) 

j£[d] s.t. 

Thus Ugl can be understood in terms of the maps Tf' 3 , which are irreducible tensor operators in 
Hom(<2^, Qf l+ej ) corresponding to the irrep Qm- (This is unlike the notation of the last section in which 
the superscript denoted the irrep corresponding to the tensor operator.) 

The plan for the rest of the section is to decompose the Tf' 3 operators under the action of Ud-i, so that 
we can apply the Wigner-Eckart theorem. This involves decomposing three different Ud irreps into Ud-i 
irreps: the input space Q^, the output space Qf l+ej and the space Qhs. corresponding to the subscript i. 

Once we have done so, the Wigner-Eckart Theorem gives an expression for Tf' 3 (and hence for U^q) in 

terms of U^q 1 ' and a small number of coefficients, known as reduced Wigner coefficients. These coefficients 
can be readily calculated, and in the next section we cite a formula from [46] for doing so. 

First, we examine the decomposition of Qfiy the W<j-irrep according to which the Tf' 3 transform. Recall 

that = Q^q) 1 © 1 - In terms of the tensor operator we have defined, this means that Tf' 3 is an 

irreducible Ud-i tensor operator corresponding to the trivial irrep Q^ 1 and {Tf' 3 , . . . , T^j^} comprise an 

irreducible Ud-i tensor operator corresponding to the defining irrep QfjT 1 . 

Next, we would like to decompose Hom( Q d , Q^ +e . ) into maps between irreps of Ud- i • This is slightly more 
complicated, but can be derived from the Ud-i C Ud branching rule introduced in Sec. IIIB. Recall that 

Ud-\ Ud-i 

Qfi — 0/i'^/x 2 M ' i an d similarly Qj l+ej = ® f _ t "^ tJ , +ej Q^n ■ This is the moment that we anticipated 
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in Sec. IIIB when we chose our set of basis vectors to respect these decompositions. As a result, a 
vector \q) G can be expanded as q = (qd-i, qd-2, ■ ■ ■ , qi) = (m'> 9(d-2)) witn Qd-i = A 4 ' G m'^A* 
and |g(d— 2)) = kd-2, • • • , Qi) € Qj 1 - In other words, we will separate vectors in into a Ud-\ irrep label 
p! G and a basis vector from Q^7 1 . 

This describes how to decompose the spaces and Q'f l+ej ■ To extend this to decomposition of 
Rom(Qf l ,Qf l+e .), we use the canonical isomorphism Hom(@ x A x , y B y ) = Q) xy Rom(A x ,B y ), which 
holds for any sets of vector spaces {A x } and {B y }. Thus 

Hom(Q^Q^)^" 1 Hom(Q^7\ Qp 1 )- (49a) 

Sometimes we will find it convenient to denote the 2^7 1 subspace of <2^ by <2^7 1 C Q^, so that Eq. (49a) 
becomes 

Hom(Q^, Q^) "s 1 Hom(Q^7 1 C Q% 2^7 1 C Q^.)- (49b) 

According to Eq. (49) (either version), we can decompose T/ IJ as 

7f=E E I^XM'l®^''"". (50) 

Here T^'^" e Hom(Q^7 1 C Q£, 2^7 1 C Q^+e 3 ) and we have implicitly decomposed |g) G into 
|M')k(d-2)>- 

The next step is to decompose the representions in Eq. (49) into irreducible components. In fact, we are 
not interested in the entire space Hom(Q^7 1 , Q^ 1 ), but only the part that is equivalent to or Q^o) 1 , 

depending on whether i G [d — 1] or i = d (since T/ 1 '^ ' M transforms according to Q^ 1 if i € {1, . . . , d — 1} 
and according to Q^q) 1 if * = d). This knowledge of how T?' J '' ,i transforms under Ud-i will give us 
two crucial simplifications: first, we can greatly reduce the range of p" for which Tf 1 ' 3 '^ is nonzero, and 
second, we can apply the Wigner-Eckart theorem to describe T- 1 ' 1 '^ ' p in terms of U^Tj 1 '. 

The simplest case is Q^S when i = d: according to Schur's Lemma the invariant component of 
Hom(Q^7 1 , Q^7 X ) is zero if p! ^ p" and consists of the matrices proportional to Iq<i-i if p! = p" . In 

other words T^'^''^" = unless p' = p", in which case T^ j '^''^ := f ^>''°/ Q<1 -i for some scalar T'w'. . 

(The final superscript will later be convenient when we want a single notation to encompass both the i = d 
and the i G {1, . . . , d — 1} cases.) 

A 1 • I II 

The (2(7) case, which occurs when i G {1, . . . , d — 1}, is more interesting. We will simplify the T i /iJ ' A1 ' M 
operators (for i = 1, ... ,d — 1) in two stages: first using the branching rules from Sec. IIIB to reduce 
the number of nonzero terms and then by applying the Wigner-Eckart theorem to find an exact expression 
for them. Begin by recalling from Eq. (39) that the multiplicity of Q^ 1 in the isotypic decomposition of 

Hom(Q^7 1 , Q^ 1 ) is given by dimHom(Q^7 1 <8> Q^ 1 , Qp 1 ) Ud - 1 . According to the U d - X CG "add a box" 

prescription (Eq. (33)), this is one if p! G p" — □ and zero otherwise. Thus if i G [d — 1], then T^ 3 ' 1 * is 
zero unless p" = p! + ey for some f G [d — 1]. Since we need not consider all possible /i", we can define 
yMJ> ,j ._ yA* j> ,m +ey rp^jg no t a tion can be readily extended to cover the case when i = d; define eo = 0, 
so that the only nonzero operators for i = d arc of the form T£'- 7> ''° := T^'**'^' = f^^'' I Q d-i . Thus, we 
can replace Eq. (50) with 

Tf J = E El/i' + e J ,><M'|®ir J " ,V+V - (51) 

Now we show how to apply the Wigner-Eckart theorem to the i G [d — 1] case. The operators T- 1 ' 3 '^ ' J 
map Q^T 1 to Q^T^g., and comprise an irreducible Ud-i tensor operator corresponding to the irrep Q(T) 1 - 
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This means we can apply the Wigner-Eckart theorem and since the multiplicity of Q^7+ e ., in Q^V 1 <8> Q^ 1 
is one, the sum over the multiplicity label a has only a single term. The theorem implies the existence of a 
set of scalars T^>'<-?' such that for any \q) E Q^T 1 and \q') E Q^7+ e . ( , 

(g/j^V J' \ q) = fujyj' M / + e ^lug- 1 ^', 9, i>. (52) 

Sometimes the matrix elements of Ucg or T^ J ' M J are called Wigner coefficients and the Tv>3>v d are known 
as reduced Wigner coefficients. 

Let us now try to interpret these equations operationally. Eq. (48) reduces the Ud CG transform to a Ud 
tensor operator, Eq. (51) decomposes this tensor operator into d 2 different Ud-i tensor operators (weighted 
by the T MJ,Ai J coefficients) and Eq. (52) turns this into a Ud-i CG transform followed by a d x d unitary 
matrix. The coefficients for this matrix are the T^'i't 1 J , which we will see in the next section can be 
efficiently computed by conditioning on fi and //. 

Now we spell this recursion out in more detail. Suppose we wish to apply U^q to = 
lA t )lA t ')| < Z(d-2))l*): f° r some i E {1, . . . , d — 1}. Then Eq. (52) indicates that we should first apply U^q 1 ' to 
|//) |<7((j_2))|i) to obtain output that is a superposition over states \fjf + e-j')\j') l'^d-2)^ f° r j' G {l,...,Gf — 1} 
and q_\d-2)^} ^ Qfj+ei • Then, controlled by /i and //, we want to map the (d — l)-dimensional \ f) register 
into the d-dimensional \j) register, which will then tell us the output irrep Qf l+ej ■ According to Eq. (52), 
the coefficients of this d x (d — 1) matrix are given by the reduced Wigner coefficients Tv<i,n J ^ so we w jn 
denote the overall matrix , := J2j j> T^^-^ +S' J \j)(j'\. 3 The resulting circuit is depicted in Fig. 4: a 
Ud-i CG transform is followed by the operator, which is defined to be 

fW = E E r"^>></i| ® |M + e^X/i'l ® l/x' + e.vX/x' + e^j- (53) 

Then Fig. 5 shows how can be expressed asadx(rf-l) matrix that is controlled by fi and jj! . In 
fact, once we consider the i = d case in the next paragraph, we will find that , is actually a d x d unitary 
matrix. In the next section, we will then show how the individual reduced Wigner coefficients T^'i^ >■? can 
be efficiently computed, so that ultimately can be implemented in time poly(d, log 1/e). 

Now we turn to the case of i = d. The circuit is much simpler, but we also need to explain how it works 
in coherent superposition with the i E [d — 1] case. Since i = d corresponds to the trivial representation of 

Ud-i, the Uqq 1 ' operation is not performed. Instead, and \q(d-2)) are left untouched and the \i) = \d) 
register is relabeled as a \f) = |0) register. We can combine this relabeling operation with U^q 1 ' in the 
i E [d—1] case by defining 



frld-l] 
U GG 



j \ 

|0)<d|® ]T WW\ 



(54) 



This ends up mapping i E {1, . . . ,d} to j' E {0, . . . ,d — 1} while mapping Q^, to Q M / +e ., • Now we can 
interpret the sum on f in the above definitions of and T' ' , as ranging over {0, so thatfj^, 

is a d x d unitary matrix. We thus obtain the circuit in Fig. 4 with the implementation of depicted in 
Fig. 5. 

We have now reduced the problem of performing the CG transform U^q to the problem of computing 
reduced Wigner coefficients T M,A1 J . 



3 The reason why jx' + e^i appears in the superscript rather than // is that after applying ' , wc want to keep a record of 
fi' + eji rather than of fi' . This is further illustrated in Fig. 5. 
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FIG. 4: The lid CG transform, Uqq, is decomposed into a Ud-i CG transform Uq G 1 ' (see Eq. (54)) and a reduced 
Wigner operator T^ d \ In Fig. 5 we show how to reduce the reduced Wigner operator to a d x d matrix conditioned 
on fx and ii! + ey . 



Im>- 



Im) — 
Im'> — 



\ej) -ffi- \n + ej) 
\t*' + ej') 



FIG. 5: The reduced Wigner transform f [d] can be expressed as a d x d rotation whose coefficients are controlled by 
\x and u' + eji . 



C. Efficient Circuit for the Reduced Wigner Operator 

The method of Biedenharn and Louck[46] allows us to compute reduced Wigner coefficients for the cases 
we arc interested in. This will allow us to construct an efficient circuit to implement the controlled-T operator 
to accuracy e using an overhead which scales like poly(logn, d, log(e _1 )). 

To compute T^'i'V , we first introduce the vectors jj, := A t +Sj=i( c ^ — j) e j an< ^ m' := M'+X^i 1 — j) e j- 
Also define S(j — f) to be 1 if j > j' and —1 if j < j' . Then according to Eq. (38) in Rcf [46], 




Hseid-i]\j(H-i*'s)llteid]\j>(P-' j '-P't + 1 ) 
n 3e [d]\ 3 (Aj-A' s )rite[ c i-i]\ 3 '(Aj/-Ai + 1 ) 

n s£ [d-l]V,(£;)-£s) 



ifj'e {i,...,d-i}. 
if/ = o. 



(55) 



The elements of the partitions here are of size 0(n), so the total computation necessary is poly (d, log n). 
Now how do we implement the transform given this expression? 

As in the introduction to this section, note that any unitary gate of dimension d can be implemented 
using a number of two qubit gates polynomial in d[41-43]. The method of this construction is to take a 
unitary gate of dimension d with known matrix elements and then convert this into a series of unitary gates 
which act non-trivially only on two states. These two state gates can then be constructed using the methods 
described in [42] . In order to modify this for our work, we calculate, to the specified accuracy e, the elements 
of the operator, conditional on the fi and fi' + ej> inputs, perform the decomposition into two qubit gates 
as described in [41, 42] online, and then, conditional on this calculation perform the appropriate controlled 
two-qubit gates onto the space where will act. Finally this classical computation must be undone to 
reset any garbage bits created during the classical computation. To produce an accuracy e we need a classical 
computation of size poly (log (1/e)) since we can perform the appropriate controlled rotations with bitwise 
accuracy. 

Putting everything together as depicted in figures 4 and 5 gives a poly (d, log n, log 1/e) algorithm to reduce 
Ugfj to U^ G 1] . Naturally this can be applied d times to yield a poly (d, log n, log 1/e) algorithm for U^q. 
(We can end the recursion cither at d = 2, using the construction in [48], or at d = 1, where the CG transform 
simply consists of the map fi — > fi + 1 for fi e Z, or even at d = 0, where the CG transform is completely 
trivial.) We summarize the CG algorithm as follows. 



Algorithm: Clebsch-Gordan transform 
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Inputs: (1) Classical registers d and n. (2) Quantum registers |A) (in any superposition over 
different A e Id,n), \q) € Qf (expressed as a superposition of GZ basis elements) and \i) G C d . 
Outputs: (1) Quantum registers |A) (equal to the input), \j) e C d (satisfying X+ej e Id n+ \) 
and|g')eQ( +e .. 

Runtime: d 3 poly(logn, log 1/e) to achieve accuracy e. 
Procedure: 

1. If d = 1 

2. Then output |j) := |i) = |1) and \q') := \q) = |1) (i.e. do nothing). 

3. Else 

4. Unpack \q) into |<7(d-2)), such that // e T dim , m < n, h'^/j, and |g( d _ 2 )) € Q^ 1 . 

5. Ifi<d 

6. Then perform the CG transform with inputs (rf— l,m, |g(d— 2)) ; ]*)) and outputs 

(lAIA^-a)))- 

7. Else (if i = d) 

8. Replace |i) = \d) with \f) := |0) and set 9( d _ 2 )) := 9(d-2))- 

9. End. (Now ie {1, . . . , d} has been replaced by j e {0, . . . , d — 1}.) 

10. Map |//>b"> to \ fi ' + e r )\j'). 

Conditioned on /x and // + e^, calculate the gate sequence necessary to implement T^ d \ 
which inputs \j') and outputs \j). 

12. Execute this gate sequence, implementing T^. 

13. Undo the computation from 11. 

14. Combine |// + ey) and 9( d _ 2 )) to f° rm I?')- 

15. End. 

Finally, in Sec. IV we described how n CG transforms can be used to perform the Schur transform, so 
that Usch can be implemented in time n ■ poly (d, log n, log 1/e), optionally plus an additional poly(n) time 
to compress the \p) register. 



VI. CONCLUSION 



We have taken on the challenge of implementing a circuit which performs the Schur transform. This 
transform, used ubiquitously[19-30] in quantum information theory, represents an important new transfor- 
mation for quantum information science. The key ingredients in the construction of this circuit were the 
relationship between Wigner operators and reduced Wigner operators and an efficient classical algorithm 
for the calculation of the matrix elements of the reduced Wigner operators. This extends our construction 
from [48] where we constructed the Schur transform for n qubits (d = 2). Our construction has a running 
time which is polynomial in dimension, d, number of qudits, n, and accuracy, log(l/e. We have thus made 
practical the large set of quantum information protocols whose computational efficiency has, prior to our 
work, been uncertain. 

For some applications, it is not necessary to perform the full Schur transform, but instead to only be able 
to perform a projective measurement onto the different Schur subspaces. In part II, we consider a quantum 
circuit, based on Kitaev's phase estimation algorithm [49], for this task. We further generalize this algorithm 
to a circuit which is applicable to any nonabelian finite group. Our algorithm is efficient if there exists an 
efficient quantum circuit for the Fourier transform over this group[33, 50] and represents an ideal way to 
efficiently deal with situations where quantum states possess symmetries corresponding to some finite group. 
Further in part II we discuss relationships between the Schur transform and the Fourier transform over the 
symmetric group. 

Finally, we will conclude with some open problems suggested by our construction of the Schur transform. 
The first interesting question which arises from our work is whether Clebsch-Gordan transforms for other 
groups can be efficiently constructed. We suspect that for many finite groups, even when dealing with 
representations which are of dimension d, that their Clebsch-Gordan transforms can be constructed using 
circuits of size polynomial in log(d). Our intuition for this claim comes from the construction of quantum 
Fourier transforms over finite groups [33, 50]. A second question is that while the Schur transform is used 
frequently in quantum information theory, it has thus far not seen used in the field of quantum algorithms. 
Kuperberg's[51] subexponential algorithm for the dihedral hidden subgroup problem makes use of the effect 
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of the Clebsch-Gordan series for the dihedral group. Does the Clebsch-Gordan series for Ud produce any 
similar speedup for the appropriately defined hidden subgroup problem on 
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